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Abstract. A modern processor can dynamically set it's speed while it's 
active, and can make a transition to sleep state when required. When 



f-«) I the processor is operating at a speed s, the energy consumed per unit 

^j^ ' time is given by a convex power function P{s) having the property that 

"^ \ P(0) > and P"(s) > for all values of s. Moreover, C > units 

t~^ ■ of energy is required to make a transition from the sleep state to the 

O^ ' active state. The jobs are specified by their arrival time, deadline and 

the processing volume. 

We consider a scheduling problem, called speed scaling with sleep state, 

Q where each job has to be scheduled within their arrival time and dead- 

line, and the goal is to minimize the total energy consumption required 
C/3 ' to process these jobs. Albers et. al. [T] proved the NP-hardness of this 

^ ^^ , problem by reducing an instance of an NP-hard partition problem to an 

instance of this scheduling problem. The instance of this scheduling prob- 
lem consists of the arrival time, the deadline and the processing volume 
^ , for each of the jobs, in addition to P and C. Since P and C depend 

CO ' on the instance of the partition problem, this proof of the NP-hardness 

t^^ ' of the speed scaling with sleep state problem doesn't remain valid when 

CO , P and C are fixed. In this paper, we prove that the speed scaling with 

sleep state problem remains NP-hard for any fixed positive number C 
^+ ^ and convex P satisfying P(0) > and P"{s) > for all values of s. 

f^ . Keywords: Energy efficient algorithm, scheduling algorithm, NP-hardness 
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1 Introduction 



A modern processor can dynamically set it's speed while it's active, and can 
j^ I make a transition to sleep state when required. When the processor is operating 

at a speed s, the energy consumed per unit time is given by a convex power 
function P{s) having the property that -P(O) > and P"{s) > for all values 
of s. Therefore, some energy is consumed even if the processor is not scheduling 
any job in the active state. On the other hand, no energy is consumed when the 
processor is in the sleep state. However, C > units of energy is required to 
make a transition from the sleep state to the active state and therefore it is not 
always fruitful to go asleep when there is no work to be processed at some point 
of time. We assume that no energy is required to make a transition from the 
active state to the sleep state, as we can always include this energy requirement 
in the sleep to active state transition. A number of problems have been studied 
under this model, e.g., [I], 0, [3], g], [5], [5], [7], [S], [S], [lU], [11]. 



Fig. 1. An instance of 1$ with n = 4 



The jobs are specified by their arrival time, deadhnc and the processing 
volume. We consider a scheduling problem where each job has to be scheduled 
within their arrival time and deadline, and the goal is to minimize the total 
energy consumption required to process these jobs. Albers et. al. [T] proved the 
NP-hardness of this problem by reducing an instance of an NP-hard partition 
problem (defined below) to an instance of this scheduling problem. The instance 
of this scheduling problem consists of the arrival time, the deadline and the 
processing volume for each of the jobs, in addition to P{s) and C that depends 
on the problem instance of the partition problem. As a result, this proof of NP- 
hardness doesn't remain valid when we are given any fixed convex function P{s) 
and a positive number C. In this paper, we prove that the problem remains NP- 
hard for any fixed positive number C and convex function P satisfying P{0) > 
and P"{s) > for all values of s. 

We would do the reduction from the following NP-hard partition problem: 
Given a finite set A of n positive integers oi, 02, . . . , a„, the problem is to decide 
whether there exists a subset A' C A such that J2aieA' ^^ ~ Sai^A'*^'- -^^'^ 
assumed that Omax > 2; otherwise, the problem becomes trivial. 

2 The Reduction and it's Properties 

Let us start with a few definitions and notations. The density of an interval 
is defined as the total workload of the jobs that completely lie in an interval 
divided by the length of the interval. The critical speed s* for P{s) is defined 
as the minimum speed that minimizes — ^. Note that the critical speed is not 

well-defined if —^^ decreases monotonically. However, this is not a realistic case 
as this would mean we can schedule all jobs at an infinite speed to get the 
schedule that requires the minimum amount of energy consumption. Therefore, 
we assume that — ^ decreases for s < s* and attains the minimum at s = s*. 
Under this assumption, the following property can be easily observed easily. 

Lemma 1. P'(s) < ^^ < -^^^ for s < s* and P'(.s*) = ^^^. 

Proof. The derivative of the function — ^ is - — ^^t — —. Since — ^ decreases 

■' s s^ s 

for s < s*, we have - — ^^t^ — — < for s < s* with equality only when s = s*. 
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Given the function P{s), a non- negative number C, and an instance Ip of 
the partition problem, i.e., the integers oi, 02, . . . , a„, let's define the parameters 
which would be used to construct an instance Is of the scheduling problem. 

Lj = e - /flj where e = C/R{d),d= (1 - e)s*,0 < e < 1/2, and / = -pTg^ . 

k ^dL,~f where k = ^^^. 

B 
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The structure of 7s is the same as the one used by Albers et. al. [T]. The job 
set in Is is partitioned into three levels. In level 0, there is only one job having 
a processing volume equal to B. Level 1 comprises of n jobs; the i-th job has 
a processing volume U and Li time units to process it. There are n + 1 jobs in 
level 2, with each job having a processing volume of 5s* and 5 > Q time units to 
process it, thereby making the density of each of these jobs equal to s*. 

In the rest of this section, we establish a few lemmas that would be useful in 
our proof of NP-hardness. 

Lemma 2. R{d) < eP(0). 

Proof. Since Ris) = P{s) - ^^s, we obtain R'{s) = F'(s) - -^. We also 
note that R{0) ~ -F'(O) > and R{s*) ~ 0. Furthermore, wc obtain from Lemma 
[Dthat R'{s*) = P'{s*) - -^ = 0, and R'{s) = P'{s) - ^^ < for s < s*. 

Along with the properties established above, R"{s) = P"{s) > implies that 
the following relationship holds. 

R{d) < (1 - — )i?(0) + —R{s*) => R{d) < eP(0). 

a 

Lemma 3. i ^A^l < es * . 

Proof. It can be easily seen from Figure [2] that |i?'(d)| > ^J_^ = slope of line 
ab. Since d ~ (1 — e).s*, it follows that \j^',,l \ < es*- n 

We would now show that our choices of U and Li satisfy the trivial constraints 
li,Li > 0. We would also that the density of all the intervals except those 
corresponding to level 2 jobs are strictly less than s*. 

Lemma 4. Li, li > for all i. 

Proof. We first prove that Li > for all i. Note that 

Li>0<^e-/ai>0<^e> fui. 
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Fig. 2. Plot of s vs R{s) 



Since Umax > tti, it suffices to show tliat e > famax- As sfiown below, it can be 
easily seen using Lemma [2l C > and < e < i . 

e > fa^ax ^ ]|^ > ^ ^ m < P(0). 

In order to show that li > 0, we observe that 

k>0^dL^>^^d{e~ fai) >^ ^d> °' ^d> -— -. 

k k k[e-fai) fc(— - /) 

Since amax > at , it suffices to show that d > —, — ^ -p- . We show it below using 

Lemma [5] and Lemma |31 
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<es*(T3Y^ 



1 -e 

< (1 - e)s* = d. 

Note that the last inequality follows from our choice of < e < i . □ 




Fig. 3. Plot of X vs G{x) 



Lemma 5. The density of all the intervals except those corresponding to level 
2 jobs are strictly less than s*. 

Proof. Let's first consider the intervals corresponding to level 1 jobs. The density 
of such an interval is -^ . Observe that 



dU - ^ 



i^i i^i 



> s* ^ d < s * 
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Since d < s*, the density of an interval corresponding to a Level 1 job is strictly 
less than s*. This also proves that the density of any interval corresponding to 
the union of a proper subset of the level 1 and level 2 jobs is less than s*, since 
such jobs are non-overlapping and the density of any interval corresponding a 
level 2 job is exactly s*. 

Finally, we consider the interval that starts from the first arrival of the level 
job and lasts till it's deadline. The density of this interval is (^LXg,^ 7*' ' ■ 
This quantity is less than s* since 



The last inequality is true since d < s*. □ 



Let us introduce the functions F(x) = — ^^z + C and Hi{x) = P{j-)Li, and 
establish some of their properties. 



Lemma 6. Fix) and Hi{x) intersect at two different points for any i. 



Proof. Consider G{x) = H,{x) - F{x) = P{j-)U - ^^x - C. Note that 



G(0) = P(0)L, - C 

= P{0){e-fa,)-C 

^ C(P(0) - J^(d)) 

i?(d) 

>^^ 6P(0) ^"^ 

>0. 



The last-but-one inequaUty follows from Lemma [2] The last inequality follows 
since < e < 1/2. 

Next, we would show that G{x) decreases for x < LiS*, attains minimum at 
X — LiS* and then finally increases. Note that G'{x) = LiP'{j-)j- jp- = 

P'{j-) j^- The following inequalities follow easily from Lemma [1] and from 

the fact that P"{s) > 0. 



G'{x) < ^ P'(^) < ^^^ ^ P'(^) < P'{s*) ^x<Us*. 



By Lemma [1] the inequalities above would be strict since x < LiS*. 



We complete the proof by showing that G{LiS*) = P{s*)Li — P{s*)Li — G 

Li VL 



— C < 0. Since G is a strictly convex function (note that G"{x) = j-P"{j-)), it 



would eventually intersects the x-axis at some point x > LiS*. □ 



Lemma 7. HHk + f ) - "'^'•''Z/i'^^''^ = P'^. 



Proof. It can be easily seen that Hl{k + f) == H[{dLi) = P'(^) = P'(d). The 
foUowing calculation also shows that ' J' I — — = P'{d). 



H,{k + ^)-F{k) _ k 
ai/k a I 



,1k 
[H,{dL,) - F{k)] 



= ^[Pid)L^-^^dU + ^M^i-C] 

k , , P(s*) Ck 
= -R{d)U + -^-^ 

= ^R(d)e - -R{d).fa, + ^^^ - — 
a, ai s* tti 

= ^C + R'id) + ^^-^ 
ai s* Gi 

^p'id)-^M + EM 

= P'{d). 



U 

Let rn and ri2 be the two roots of the equation F{x) = Hiix) such that 
Til < ri2. We establish the following two lemmas. 

Lemma 8. Q < U < rn, where rn is the first intersection of Hi{x) and F{x). 

Proof. Since Hi{x) is strictly convex at every a;, we obtain 

k ai/k ai/k ai/k 

^ ~F{k) > -H,{k) 
^ G{k) > 0. 

The lemma follows since k < LiS*. O 

Lemma 9. rn < {k + ^) < ri2. 

Proof. Assume for the sake of contradiction that h + ^ < rn. It can be seen 

from FigureHthat it implies ^'^''"^^fc"^^'''' - "^^ ^ ^'^'^^ - ^^- However, 
this leads to a contradiction since d < s*, and P'{d) < — t^ by Lemma [1] On 
the other hand, it's easy to see that li + j^ — dLi < LiS* < ri2 . □ 
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Fig. 4. Plot of X vs F{x) and H{x) 



3 Proof of NP-hardness 

In this section, we would complete the proof of the NP-hardness of the speed 
scaling with power down problem. We would be using the following result by 
Irani et al. [3] along with the results derived in the previous section. 

Lemma 10. J3I There exists an optimal solution of the "speed scaling with a 
sleep state" problem that satisfies the following properties: 

— A job j must be scheduled at a constant speed Sj. 



— Suppose that the arrival time and the deadline of a job j is rj and dj , re- 
spectively. If another job k is scheduled in the interval [rj^dj], then Sk > Sj. 

— The jobs in the intervals having density at least s* are scheduled according 
to the YDS algorithm Jll}/ . The YDS algorithm is an iterative algorithm. In 
each iteration, an interval with the maximum density is identified and an 
earliest- deadline- first policy is used to construct a schedule for the jobs that 
lie completely in that interval. After an iteration, the YDS algorithm removes 
the jobs that lie completely in the maximum density interval corresponding 
to that iteration, and updates the arrival time and deadline of any job that 
overlaps with that interval. 

Theorem 1. An instance Ip of the partition problem admits a partition if and 
only if there exists a a feasible schedule for Is with total energy consumption of 

n 

at most {n + l)SP{s*) + J2 Fih) + BP'{d). 



Proof. (=>) Let's first assume that Ip admits a partition and construct a feasible 

71 

schedule of energy at most (n + l)eP(s*) + ^ F{li) + BRmin- We start with 
some notations. 



Let A' be the set of i corresponding to the solution of Partition problem, 
i.e., J2 '^i — ^^ — ■ Let bi denote the portion of the workload of Level job 

n 

scheduled in gap gi. It can be seen that ^ bi ~ B. We set the 5,;'s as follows: 

ai/k, if i G A' 
0, otherwise. 

Our schedule executes any Level 2 jobs with speed s* between it's release time 
and deadline. This is feasible since the density of any such job is equal to s*. 
Therefore, a total workload of li -\- ai/k = dLi has to be scheduled in each gap 
Qi corresponding to an i e A' . We schedule both the jobs in gap gt with speed d. 
In the rest of the gaps, the Level 1 jobs are scheduled at speed s*. The processor 
transitions to the sleep state at the completion of the job in such gaps, and 
wakes up at the release time of a Level 2 job. Since the density of any interval 
corresponding to a Level 1 job is less than s*, we get a feasible schedule. 

Let us calculate the total energy consumed by the jobs at every level. First 
of all, the total energy consumed by the Level 2 jobs is (n + l)SP{s*). In the 
gaps corresponding to i G A', wc note that the jobs are proceeded at a speed 
d = ( ' L ) ^'-'^ ^« units of time. The energy consumption in such a gap 
equals P{ '+^'/ ' )Li, which is the same as Hi{li + ^). In a gap correspond- 
ing to i ^ A' , a. total li units of workload are scheduled at speed s* and then 
the processor transitions to sleep state. Therefore, the energy consumed is given 
by P(s*)^ + C, which is the same as F{li). From lemma [71 Hi{li + ^) can be 
written asF{k) + P'{d)f. 



Let £^0,1 denote the total Energy consumed by the Level and Level 1 jobs. 
We obtain the following. 

i£A' i(fA' 



ieA' 



Y^[F{k)+P'{d)^)+Y,F.ih) 

ieA' HA' 

±Fik)+p\d)j:^^ 

n 

Y,nh)+p'{d)B. 



The last equality follows since X) ^i ~ Tl ^i/^ ^ (S o,i)l^^ = -S- Hence, 
we get a feasible schedule whose total energy consumption is (n + l)(5P(s*) + 

n 

Y.F{U) + P'{d)B. 



I,E,^^) 




sir>p-e[x) — ^lope of ah 



Fig. 5. Plot of X vs the lower envelope function LEi{x) 



(<^) In the reverse direction of the proof, we assume that Ip doesn't admits 
a partition and show that the energy consumption in any feasible schedule is 

n 

strictly greater than {n + l)5P{s*) + Y, F{k) + BP'{d). 

4=1 

Let LEi{x) = min{F{x),Hi{x)} denote the lower envelope of the functions 
F{x) and Hi{x), represented by solid curve in Figure[S] Let slope(x) denote the 
slope of the line joining {li, LEi{li)) and {x,LEi{x)). For x > Ij, LEi{x) can be 
written as LEi{x) = LEi{li) + {LEi{x) - LEi{li)) = F{li) + {slope{x)) * {x ~ li). 
We note that the slope(x) is minimum at x = li + ^ and the minimum value is 
H'{li + ^) = P' (d) (by Lemma [7]) which is independent of i. 

Consider an Optimal schedule S satisfying the properties of lemma [TOl and let 

6i , 62 ,...., 6„ units of workload of Level job be scheduled in the gaps 51 , 172 , , 5n : 

respectively. Let A' = {i\rii < k + hi < ri2}. 



Case 1. bi 



for some i E A' 



Since the workload U + bi is greater than rn and less than ri2, it is benefi- 
cial to schedule it at the speed {k + bi)/ Li (rather than to schedule it with the 
speed s*) and then transition to sleep state. From Lemma ITOl it follows that the 
ratio (li + hi) /Li must be the same for all i E A' uv the schedule S. Take i E A' 
corresponding to hi = ^. We show below that bj must also be equal to aj/k for 
all j G A' in an optimal schedule. 

(bi + li) (bj+ij) Ui/k + dLi — Ui/k bj+dLj—Qj/k 



L, 



d = d + 



hj - aj/k 



L, 



i, 



hj = aj/k. 
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Lemma[TU]says that all the intervals having density greater than or equal to s* 
must be scheduled according to YDS in the schedule S. Also, Lemma [T] tells that 
all the intervals except those corresponding to Level 2 jobs are having density 
less than s*. Therefore, in the schedule S, all Level 2 jobs must be scheduled at 
s*. Thus, the total energy consumed by the Level 2 jobs is {n + l)SP{s*). 

Let us again denote the total energy required by the Level and level 1 
jobs as -Bo,!- In a gap corresponding to i ^ A', it is optimal to schedule the 
job at speed s* and then transition to sleep state than scheduling at the speed 
{k + bi)/Li. When this is feasible, the energy consumption in the gap would 
be given by LE{li + hi). When it's not (i.e., if {h + bi)/Li > s*), the energy 
consumption in the gap would be greater than LEi{li + hi). Therefore, we obtain 
the following lower bound on ii^o,!- 



^0,1 > Y. ^-^'(^' + y ) + 5] i^.(^^ + bi) 



i£A' HA' 



= y: (Fih) + p'id)^) + Y (Fih) + ^b.) 



i£A' i^A' 



i=l i£A' i£A' 



If ^ 6i = B, it implies that X! if ~ ^^2k ' which contradicts our as- 

iGA' i£A' 

sumption that a solution of the partition problem does not exist. Therefore, 
J2 bi < B, which implies that ^ ai/k < B. The following calculation com- 

iGA' i£A' 

pletes the proof of Case 1. 

EoA > V Fik) + P'{d) V a,lk + ^^^{B - V a,/k) 
z — / z — / g^ z — / 

= ± F{k) + B^^ - iY -^/k){^^ - P'{d)) 

i=l i^A' 



>YF{h)+B^-^-BC-^-P'{d)) 

i=l 
n 

= YF^h)+BP\d) 



i=l 



Case 2. 6, 7^ ai/k for all i G A' 
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In this case, the fohowiiig calculation completes the proof. 
^0,1 = Yl (^(^») + ^lopeik + b,) *b^) + Y^ {F{1,) + slopeik + b,) * &,) 

ieA' i^A' 

> Y. F{k) + P'{d) E ^^ + E ^('^) + P'^d) Y b^ 

iGA' i^A' i^A' i^A' 

n 



a 



References 

1. S. Albers and A. Antoniadis. Race to Idle: New algorithms for speed scaling 
with a sleep state. Proceedings of the 23rd Annual ACM-SIAM Symposium on 
Discrete Algorithms, 1266-1285, 2012. 

2. E. Bampis, C. Diirr, F. Kacem and I. Mills. Speed scaling with power down 
scheduling for agreeable deadlines (submitted). 

3. P. Baptiste. Scheduling Unit Tasks to Minimize the Number of Idle Periods: 
A Polynomial Time Algorithm for Offline Dynamic Power Management. Proc. 
17th Annual ACM-SIAM Symposium on Discrete Algorithms, 364-367, 2006. 

4. P. Baptiste, M. Chrobak and C. Durr. Polynomial time Algorithms for Minimum 
Energy Scheduling. Proc. 15th Annual European Symposium on Algorithms, 
136-150, 2007. 

5. N. Bansal, H. L. Chan, T. W. Lam and K. L. Lee. Scheduling for speed bounded 
processors. Proc. 35th International Colloquium on Automata, Languages and 
Programming, 409-420, 2008. 

6. N. Bansal, H. L. Chan, K. Pruhs and D. Katz. Improved bounds for speed scaling 
in devices obeying the cube-root rule. Proc. 36th International Colloqium on 
Automata, Languages and Programming, 144-155, 2009. 

7. H. L. Chan, W. T. Chan, T. W. Lam, K. L. Lee, K. S. Mak and P. W. H. 
Wong. Optimizing throughput and energy in online deadline scheduling. ACM 
Transactions on Algorithms 6, 2009. 

8. X. Han, T. W. Lam, L. K. Lee, I. K. K. To and P. W. H. Wong. Deadline 
scheduling and power management for speed bounded processors. Theoretical 
Computer Science 411: 3587-3600, 2010. 

9. S. Irani, S.K. Shukla and R. Gupta. Algorithms for power savings. ACM Trans- 
actions on Algorithms 3, 2007. 

10. S. Irani and K. Pruhs. Algorithmic Problems in Power Management. SIGACT 
News, 36:63-76, 2005. 

11. F.F. Yao, A.J. Demers and S. Shenker. A scheduling model for reduced CPU 
energy. Proc. 36th IEEE Symposium on Foundations of Computer Science, 374- 
382, 1995. 



12 



